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Abstract 

We show that the probabihty that a finitely supported random walk on a non-elementary sub- 
group of the the mapping class group gives a non-pseudo-Anosov element decays exponentially 
in the length of the random walk. More generally, we show that if i? is a set of mapping class 
group elements with an upper bound on their translation lengths on the complex of curves, then 
the probability that a random walk lies in R decays exponentially in the length of the random 
walk. 
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1 Introduction 

Let E be a compact oriented surface of finite type, and let G be the mapping class group of E, i.e. 
the group of orientation preserving homeomorphisms of E, up to isotopy. Let /z be a probability 
distribution on G with finite support. A random walk on G is a Markov chain with transition 
probabilities p{x,y) = iJ,{x~^y). We will always assume we start at the identity at time zero, and 
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we will write Wn for the location of the randora walk at time n. The probability distribution /j, 
need not be symmetric, but we shall always assume that the semi-group generated by the support 
of is a non-elementary subgroup of the mapping class group. A subgroup of the mapping class 
group is non- elementary if it contains a pair of pseudo-Anosov elements with distinct fixed points 
in VMF, Thurston's boundary for the mapping class group. Rivin [Riv08[|Riv09| and Kowalski 
[KowOSj showed that the probability that a random walk on the mapping class group gives rise to a 
pseudo-Anosov element tends to one, as long as the group generated by the support of the mapping 
class group maps onto a sufficiently large subgroup of Sp{2g, Z). Furthermore, they showed that the 
probability that an element is not pseudo-Anosov decays exponentially in the length of the random 
walk. Malestein and Souto jMSllj and Lubotzky and Meiri jLMllj extended this to the Torelli 
subgroup, by considering the action of the Torelli group on the homology of double covers of the 
surface. 

In jMahllj it was shown that the probability that a random walk gives a pseudo-Anosov element 
tends to one for all non-elementary subgroups of the mapping class, by considering the action of the 
mapping class group on the complex of curves, but no information was obtained about the rate of 
convergence. In this paper we show that the rate of convergence is exponential; in fact, wc show 
that for any constant S, the probability that a random walk gives an element of translation length 
at most B on the complex of curves decays exponentially in the length of the random walk; the rate 
of decay depends on B. 

We say the surface S is sporadic if S is a sphere with at most four punctures, or a torus with 
at most one puncture. The complex of curves C(S]) is a simplicial complex, whose vertices consist 
of isotopy classes of simple closed curves, and whose simplices are spanned by disjoint simple closed 
curves. The mapping class group G acts by simplicial isometrics on the complex of curves, and 
Masur and Minsky [MM99j showed that an element is pseudo-Anosov if and only if its translation 
length on C(S]) is positive. We will use Landau's "big O" notation, so 0{g{x)) denotes some function 
f{x) such that f{x) ^ C \g{x)\ for some constant C > 0, and for all x sufficiently large. 

Theorem 1.1. Let G be the mapping class group of a non-sporadic surface of finite type, and let Wn 
be a random walk of length n on G generated by a finitely supported probability distribution ^, whose 
support generates a non- elementary subgroup of the mapping class group. Then for any constant 
B > 0, there is a constant c < 1 such that 

P(tK) S) ^ 0(c"), 

where T{wn) is the translation length of Wn acting on the complex of curves. 

If the surface is sporadic, then the mapping class group is cither finite, or commensurable to 
SL{2, Z), and in the latter case the result follows from the work of Rivin |Riv08[[Riv09| or Kowalski 
[Kow08| on random walks on matrix groups. Theorem 11.11 docs not apply to the Torelli group 
of the genus two surface, as this group is not finitely generated, as shown by McCuUough and 
Miller |MM86| . However, the results of |Mahll| hold in this case, but with no rate of convergence 
information. 

There are two main steps, both of which use the improper metric on G arising from its action on 
the complex of curves, which we shall denote by d{g, h) = dccs2)igxo,hxo), where xq is a basepoint 
in the complex of curves C{T,); this is also known as a relative metric on G. The first is to show 
that the random walk has a linear rate of escape in the relative metric, with exponential decay 
for the proportion of sample paths making progress at lower rate. The second is to consider the 



2 



distribution of elements of bounded translation length on the complex of curves. If g is an element 
of bounded translation length on the complex of curves, then g is conjugate to an element s, of 
bounded relative length. Furthermore, if v is chosen to be a shortest conjugating element, then the 
path vsv~^ is quasigcodesic, with quasigeodesic constants depending only on G and the bound on 
translation length. This means that if a random walk Wn is conjugate to an element of bounded 
translation length, then if the first half of a geodesic from 1 to w„ fellow travels with some geodesic 
from 1 to V, then the second half of the geodesic from 1 to u)„ fellow travels with a translate of a 
geodesic from 1 to v~^. This fellow travelling condition is equivalent to the condition that the pair 
(w„, lies in a certain neighbourhood of the diagonal in G x G, and we show that the probability 
that this occurs decays exponentially in the length of Wn- 

We now give a brief summary of the organization of the paper. The remainder of this section 
is devoted to a detailed outline of the argument described in the previous paragraph. In Section 
[2] we introduce some standard definitions and fix notation. In particular, we define subsets of G, 
called shadows, and find upper bounds for the probability that a random walk lies in a shadow. In 
Section [3] we show the linear progress result, with an exponential decay bound for the proportion 
of paths making linear progress below some rate. Finally, in Section 2] we show that the fellow 
travelling property described above is equivalent to the condition that the pair {wn,w~^) lies in a 
certain neighbourhood of the diagonal in G x G, consisting of unions of shadows, and we show that 
the probability that a random walk lies in one of these neighbourhoods decays exponentially in the 
length of Wn- 

1.1 Outline 

We will consider the action of the mapping class group on the complex of curves, see Farb and 
Margalit jFMj for an introduction to the mapping class group. The complex of curves C(E) is a 
simplicial complex, whose vertices are isotopy classes of simple closed curves, and whose simpliccs 
are spanned by disjoint simple closed curves. The complex of curves is finite dimensional, but not 
locally finite. We will only need to consider distances between vertices in the curve complex, and so 
we consider the 1-skclcton of the complex of curves to be a metric space (C(S]), (ic(i;))i by assigning 
every edge to have length 1. By abuse of notation, we will refer to this as a metric on the curve 
complex. The mapping class group acts on the complex of curves by simplicial isometrics, and 
a choice of basepoint xq in the complex of curves determines a map from G to C{T,) defined by 
g I— > g{xQ). We may therefore define an improper metric on the mapping class group by 

d{g,h) = dc{^^{gxo,hxo). 

We emphasize that throughout this paper the metric d will always refer to this improper metric 
induced from the action of the mapping class group on the complex of curves, and never a proper 
word metric on G with respect to a finite generating set. However, the metric d is quasi-isometric 
to a word metric on G with respect to an infinite generating set, also known as a relative metric, 
formed by starting with a finite generating set and adding subgroups which stabilize vertices in the 
complex of curves which correspond to distinct orbits under the action of the mapping class group. 
Masur and Minsky |MM99| showed that the curve complex is Gromov hyperbolic, and wc will write 
dG for the Gromov boundary of G, and G for G U dG. 

A random walk of length ri on G is a product of n independent identically /^-distributed random 
variables Si, which we shall call the steps of the random walk, so Wn = siS2 ■ ■ ■ Sn, and Wn is 
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distributed as fin, the n-fold convolution of /x with itself. A random walk converges to the Gromov 
boundary almost surely, so this gives a hitting measure, known as harmonic measure on 9G, and 
which wc shall denote by ly. We will need to estimate the probability that a random walk lies in 
particular subsets of G. We now define a family of subsets of G which we shall call shadows. Recall 
that the Gromov product of x and y with respect to 1 is equal to the distance from 1 to a geodesic 
from X to y, up to a bounded error which only depends on 6. Given a real number r, we can use the 
Gromov product to define the shadow of a point x in G, which wc shall denote by Si{x,r), 

Si{x,r)={yeG\ix-y)i^r}. 



Si{g,r) 



Figure 1: A shadow of a point. 

We warn the reader that we use a different parameterization of shadows than that used by other 
authors, for example, Blachere, Haissinsky, and Mathicu, [BHMOSj define their shadows Ui{x,r) to 
be Si{x, d{l, x) — r) D dG in our notation. We show that both the harmonic measures of shadows, 
and the /i„-measures of shadows, decay exponentially in r, i.e. there are constants K and c < 1 such 
that h'{Si{x,r)) ^ and /i„(S'i (x, r)) ^ Kd, for all x, r and n. 

In |MahlO] . we showed that a random walk makes linear progress in the relative metric, almost 
surely, i.e. there is a constant L > such that 

P((i(l, Wn) ^ Ln) — !• as n — oo. 

We need a stronger version of this result, which gives an exponential decay bound for the rate of 
convergence. To be precise, wc show: 

Theorem 1.2. Let G be the mapping class group of a non-sporadic surface of finite type, and let Wn 
be a random walk of length n on G, generated by a finitely supported probability distribution fi, whose 
support generates a non- elementary subgroup of the mapping class group. Then there are constants 
L > and c < 1, such that 

P(d(l,w„) < Ln) 0(c"), 

where d is the non-proper metric on the mapping class group arising from its action on the complex 
of curves. 

We now give a brief overview of the proof of Theorem 11.21 Consider taking the random walk k 
steps at a time, i.e. consider Wnk instead of w„, which we shall refer to as the fc- iterated random 
walk. We shall write for Wnk, and the steps of the fc- iterated random walk are given by = 
Skn-kSkn-k+i ■ ■ ■ Skn- The increments of the walk, s^, are all independent and identically distributed, 
with distribution fik. However, the distance travelled at time nk, given by d(l,u)^j) is not the sum 
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of the distances travelled at each step of the fc-iterated walk, d{wj^,w'^^i), as there may be some 
"backtracking" , illustrated schematically in Figure [2] 



Wo 



Wo 



Figure 2: Steps of the iterated random walk. 

The distance w^) is the sum of the d{w'^_i,w^) for i ^ n, minus the total amount of back- 
tracking. A key fact is that the distribution of the amount of backtracking at time i is bounded 
above by an exponential function, and furthermore, the same upper bound holds for all i, indepen- 
dent of the locations of the random walk, or the amount of backtracking at other times, and also 
independent of fc, the number of steps for each segment of the fc-iterated random walk. We now 
explain why this is the case. The amount of backtracking can be estimated as follows. The size of 
the backtrack from to is roughly the distance from wf.^ to the geodesic from 1 to . After 
applying the isomctry w^_i, this is the same as the distance from 1 to the geodesic from 
to {wf_i)~^w^ , as illustrated in Figure [31 

iwl^-^w'^ = 



(wl^-' 1 



Figure 3: A single backtrack, after the isometry w^_^. 

The point {■w^_i)^^w'l is equal to s^, and the pair of random variables {{w'i;_i)~^, s^) are indepen- 
dent, and distributed as Jlk{i-i) ^ Mfcj where pt„ is the n-fold convolution of the reflected distribution 
]l{g) = iJ.{g^^). If the backtrack is of length at least r, then distance from 1 to a geodesic from 
{wi_i)~^ to is at least r, up to bounded error, so in turn the Gromov product {{w^_i)^^ ■ sf)i 
is at least r up to bounded error. Therefore s*^ lies in Si{{w^_i)^^ ,r — K), for some K which only 
depends on S. We show that both the harmonic measure and the convolution measures /i„, of 
shadows 6*1(3:;, r) are bounded above by a function which decays exponentially in r, and furthermore, 
the upper bound function is independent of both x and n. Therefore, the probability that there 
is a backtrack of size r decays exponentially in r, independently of k, and also independently of 
the locations of the random walk at other times. In particular, the expected size of a backtrack is 
bounded independently of k, so by choosing k sufhciently large, we can ensure that the expected 
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value of each fc-iterated step d{wf_i,w'^) is larger than the expected value of a backtrack. Further- 
more, applying standard Bernstein or Chernoff-Hoeffding estimates for concentration of measures, 
we obtain bounds for the probability that the sums of the first n backtracks and fc-iteratcd steps 
deviate from their expected values, and these bounds decay exponentially in n. This implies that the 
distance away from the origin grows linearly at some rate, with exponential decay for the proportion 
of paths making progress below this rate. 

We now wish to show that the probability that Wn is pscudo-Anosov tends to 1 exponentially 
quickly. The translation length of a group element on the complex of curves is 

T(g)= hm 

Masur and Minsky |MM99| showed that the pseudo-Anosov elements are precisely those elements 
with non-zero translation length. The translation length of an element g acting on the complex 
of curves is also coarsely equivalent to the shortest length of any conjugate of g, measured in the 
relative metric on the mapping class group. In [Mahllj we showed that the mapping class group 
has relative conjugacy bounds^ i.e. there is a constant K such that if two group elements a and b are 
conjugate, then a = vbv~^ for some element v whose length is bounded in terms of the lengths of a 
and b, 

d{l,v) < K{d{l,a) +d{l,b)). 

We emphasize that the distance d here is the relative or curve complex distance on the mapping 
class group. Every group element g corresponds to a point in G, but we may also think of g as 
representing some choice of geodesic from 1 to g. We may therefore think of a product of group 
elements as representing a path in G, composed of concatenating various translates of geodesies 
representing each element in the product. In particular, the word vbv^^ corresponds to a path 
consisting of three geodesic segments. As the curve complex is J-hyperbolic, one may show that if v 
is chosen to be a conjugating word of shortest relative length, then the path vbv~^ is a quasigeodesic 
path in the curve complex, where the quasigeodesic constants depend on the relative conjugacy 
bound constant K, and the length of b. 

If we choose i? to be a collection of group elements of conjugacy length at most B, then every 
element g G i? is equal to vsv~^, where d(l, s) ^ B, and the paths vsv~^ are uniformly quasigeodesic 
over all elements of R. This implies that the first half of the geodesic from 1 to 5 fellow travels with 
a translate of the inverse of the second half of the geodesic from 1 to Wn- In order to find an upper 
bound on the probability that this occurs, it is convenient to express this fellow travelling property 
in terms of the location of the pair {g,g~^) in G x G. The fact that vsv~^ is quasigeodesic implies 
that g £ Si{v,r) and g^^ S Si{v,r), where r is equal to ^d{l,g), up to an additive error which 
only depends on S and the quasigeodesic constants. We may extend the definitions of shadows to 
subsets [/ C G by setting Si{U, r) to be the union of all Si{g, r) over all points g € U. We may then 
extend the definition of shadows to subsets C/ C G x G, by setting Si{U,r) to be the union of all 
Si (51 yf)^Si (52 , f) , over all (gi ,52) G U . In particular, if a random walk w„ lies in R, then the pair 
{wn,w~^) lies in 6*1 (A, r), a shadow of the diagonal A in G x G, where r is roughly ^^(l, By the 
linear progress result, we may assume that r grows linearly in n, up to a set of paths whose measure 
decays exponentially in n. The distribution of pairs {wn,w~^) is obviously not independent, as Wn 
determines w~^, but they are asymptotically independent, and converge to v xv. In fact, we may 
approximate the distribution of pairs {'W2n,W2n) by the distribution of pairs This is 

because as sample paths converge to the boundary almost surely, it is probable that the the point 
Wn looks close to the point W2n, as viewed from the origin 1, as illustrated in Figure |4] below. 
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Figure 4: A path of length 2n. 



Similarly, standing at W2n and looking back towards the origin 1, the point 1 looks close to the 
midpoint of the path w„. If we apply the isometry w^„^, this implies that and W2nWn look close 
together when viewed from 1. We can make this precise, and we show that the probability that W2n 
lies in the shadow Si{wn,d{l,Wn) — K) tends to one exponentially quickly, for some K which only 
depends on the constant of hyperbolicity S. The same argument shows that the probability 
lies in 5i(u'^„^z«„, w^„^it;„) — K) tends to one exponentially quickly. The pair {wmW2nU)n) is 
independent, and distributed as /i„ x Jin- We may then use the fact that the measure for shadows 
of points decays exponentially in r to show that the fi„ x /!„ measure of a shadow of the diagonal 
in G X G also decays exponentially in As r grows linearly in ri, this shows that the probability 
Wn has bounded translation length decays exponentially in n. 
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2 Preliminaries 
2.1 Random walks 

We now review some background on random walks on groups, see for example Woess [WoeOO] . Let 
G be the mapping class group of an orientable surface of finite type, which is not a sphere with 
three or fewer punctures, and let fi he a probability distribution on G. We may use the probability 
distribution /i to generate a Markov chain, or random walk on G, with transition probabilities 
p{x,y) = ^(x~^y). We shall always assume that we start at time zero at the identity element of the 
group. The step space for the random walk is the product probability space (G, /^)^+ , and we shall 
write (si, S2, ■ • ■ ) for an element of the step space. The Si are a sequence of independent, identically 
/i-distributed random variables, which we shall refer to as the increments of the random walk. The 
location of the random walk at time n is given by Wn = S1S2 . . . s„, and so the distribution of random 
walks at time n is given by the n-fold convolution of /i, which we shall write as The path space 
for the random walk is the probability space (G^+,P), where G^+ is the set of all infinite sequences 
of elements G, and the the measure P is induced by the map (si, S2, . . ■) 1— > (wi, W2, ■ • •)• 

We shall always require that the group generated by the support of is non- elementary, which 
means that it contains a pair of pseudo-Anosov elements with distinct fixed points in VMJ-. We do 
not assume that the probability distribution /x is symmetric, so the group generated by the support 
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of /i may be strictly larger than the semi-group generated by the support of ^. Throughout this 
paper we will need to assume that the probability distribution ^ has finite support. 

In [Mahllj . we showed that it followed from results of Kaimanovich and Masur [KM96j and 
Klarreich |Kla| . that a sample path converges almost surely to a uniquely ergodic, and hence minimal, 
foliation in the Gromov boundary of the relative space. This gives a measure v on Jvim, known as 
harmonic measure. The harmonic measure v is /^-stationary, i.e. 

v{X) = Y,^,{g)v{g-^X). 

Theorem 2.1. jKMOGlfKlallMahllj Consider a random walk on the mapping class group of an 
orientable surface of finite type, which is not a sphere with three or fewer punctures, determined 
by a probability distribution fjL such that the group generated by the support of pL is non- elementary. 
Then a sample path converges to a uniquely ergodic foliation in the Gromov boundary J-min of 

the relative space G almost surely, and the distribution of limit points on the boundary is given by a 
unique pi-stationary measure v on Train- 

It will also be convenient to consider the reflected random walk, which is the random walk 
generated by the reflected measure /i, where = i-i{g^^). We will write v for the corresponding 
pi-stationary harmonic measure on 

2.2 Coarse geometry 

We briefly recall some useful facts about Gromov hyperbolic or J-hyperbolic spaces, and fix some 
notation. A (5-hyperbolic space is a geodesic metric space which satisfies a d-slim triangles con- 
dition, i.e. there is a constant d such that for every geodesic triangle, any side is contained in a 
(5-neighbourhood of the other two. Let (G, d) be a J-hyperbolic space, which need not be proper. 
We shall write dG for the Gromov boundary of G, and let G = G U dG. Given a subset X C G, we 
shall write X for the closure of X in G. Given a point z G G, the Gromov product based at z is 
defined to be 

{x ■ y)z = l{d{z,x) + d{z,y) - d{x,y)). 
We may extend the definition of the Gromov product to points on the boundary by 

(x • y)z = supliminf(xi • yj)^ 

where the supremum is taken over all sequences Xi — > x and yj y. This suprcmum is finite unless 
X and y are the same point in dG. 

We will make use of the following properties of the Gromov product, see for example, Bridson 
and Haefliger |BH991 III.H 3.17]. 

Properties 2.2 (Properties of the Gromov product). 

1. The Gromov product [x ■ y)^ is equal to the distance from z to a geodesic from x to y, up to 
a bounded error of at most 5. 

2. For any three points x,y, z G, 

{x ■ y)i > min{(a; • z)i, {y ■ z)i} - 26. 
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3. If y £ dG, then there is a sequence yi ^ y with hm„(a; • = {x ■ y)i. 

4. For any x € G, and for any sequence yi ^ y E dG, 

{x ■ y)i - 2(5 ^ liniinf(2; • yi)i {x ■ y)i. 



We will also use the following stability property of quasi-geodesics in a 5-hyperbolic space. Let / 
be a connected subset of M. A quasi-geodesic is a map j: I G which coarsely preserves distance, 
i.e. there are constants K and c such that 

^\s^t\-c^ d(7(s), 7(<)) ^K\s^t\ + c. 

For every K and c there is a constant L, which depends only on K, c and (5, such that a finite 
(K, c)-quasigeodcsic is Hausdorff distance at most L from a geodesic connecting its cndpoints, see 
Bridson and Haefliger jBH99[ III.H Theorem 1.7]. 

Finally, we will also use the fact that nearest point projection onto a geodesic 7 is coarsely well 
defined, i.e. there is a constant K, which only depends on S, such that if p and q are nearest points 
on 7 to X, then d{p, q) ^ K. Furthermore, if y is a point on a geodesic 7, and x is a point with 
nearest point projection p on 7, then the path consisting of a geodesic from x to p, and then from 
p to 2/ is a bounded Hausdorff distance Ki from a geodesic from x to y, where Ki only depends on 
(5, see for example |MahlO[ Proposition 3.1]. 

2.3 Shadows 

Given a point x e G and a real number r, we define the shadow of x based at 1, written as Si{x^ r), 
to be 

Si{x,r)^{yeG\{x-y)i^r}. 

If a; e G, and r ^ d{l, u) + 2(5, then Si (x, r) is empty. If r ^ 0, then Si (x, r) consists of all of G. 

We warn the reader again that this definition of a shadow differs slightly from that of other 
authors, for example Blachere, Hai'ssinsky, and Mathieu (BHMOSj . who define their shadows U(l,r) 
to be S'l (x, (i(l , x) — r) n 9G, in our notation. We also remark that it is possible to use the Gromov 
product to define a metric on the Gromov boundary, where roughly speaking the distance between 
two boundary points is e~'^'^, where d is the Gromov product of the two points based at 1. In this case, 
the intersection of a shadow with the boundary is a small metric neighbourhood of the boundary 
point. However, we wish our neighbourhoods to include points in G, for which the boundary metric 
is not defined, so we find our definition of shadows more convenient. 

We may extend the definition of shadows from points to arbitrary subsets of G. Given a subset 
U C G, we define the shadow of U based at 1. written Si{U,r), to be the union of the shadows of 
all points of J7, i.e. 

^i(f/,r)= U 5i(x,r). 

Note that if U contains points in G, then in general U <f. Sr{U). However, it is not hard to show 
that Pi Si{U, r) = U n dG, though we will not use this fact directly. 

There is a lower bound on the Gromov product of any two points in the shadow of a single point, 
which we now state as a proposition. This is a direct consequence of Propertv l2. 2121 above. 
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Proposition 2.3. For any y, z £ Si{x, r), the Gromov product {y ■ z)i ^ r — 25. 

Shadows are closed subsets of G, and we now show that a shadow of a point is the closure of its 
intersection with G. 

Proposition 2.4. Si{x,r) = Si{x,r) n G. 

Proof. Suppose yi is a sequence of points in Si{x, r) fl G, and yi y & dG. Then, by the definition 
of a shadow, {x-yi)i 5^ r for all i. This implies that liminf (a;-?/i)i 5^ r, and so sup liminf(x -7/^)1 ^ r. 
Therefore, by the definition of the Gromov product for points in the boundary, (a; • y)i ^ r, and so 
y€Si{x,r). 

Conversely, if y £ S'i(x', r) n dG, then by Propertv 12.2131 there is a sequence yi y with 
lim„(x - yi)! = (x ■y)i, and as the Gromov product takes values in Z, we may pass to a subsequence 
such that {x ■ yi)i = (x • y)i, and so this gives a sequence yi contained in S'i(x,r), which converges 
to y. □ 

We shall write i]d{T) for all points which lie in a metric Z?-neighbourhood of T n G, i.e. 

Vd{T) = {g e G I d{g,t) D for some teTnG}. 

We now show that all points in a metric D-neighbourhood of a shadow of T, are contained in a 
slightly larger shadow of T. 

Proposition 2.5. For any D ^ 0, 

ilDiSi{T,r)) C Si{T,r-D). 

Proof. If g G TjoiSriT)), then there is a point h G Sr{T) with d{g, h) D, and a point t E T with 
{h ■ t)i ^ r. By the definition of the Gromov product, {g ■ t)i ^ {h ■t)i — D which in turn is at least 
r — D, so g e Sr-D{T), as required. □ 

We will use the following properties of shadows of points, which follow from elementary argu- 
ments, see Calegari and Maher [CMlOj for detailed proofs. We state the results using the current 
notation of this paper. 





> A 








SAx,r-A-Ki) 


S,(x, r) 



Figure 5: Sufficiently nested shadows are metrically nested 

Lemma 2.6 (Nested shadows are metrically nested). |CM101 Lemma 4.5] There is a constant K2, 
which only depends on 6, such that for all positive constants A and r, and any x, z € G with d{x, z) ^ 
A + r + 2K2, the shadow Sz{x, r) is disjoint from the complement of the shadow Sz{x,r — A — K2). 
Furthermore for any pair of points a,b £ G such that a G Sz{x, r) and 6 G G \ Sz{x,r — A — K2), 
the distance between a and b is at least A. 
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S,{x, r) 



Figure 6: Changing basepoint for a shadow 

Lemma 2.7 (Change of basepoint for shadows). |CM101 Lemma 4.7] There are constants K3 and 
K^, which only depend on S, such that for any r, and any three points x,y, z G G with {x-y)z ^ r—K^, 
there is an inclusion of shadows, 

Sz{x,r) C Sy{x,s), 

where s ~ d{x, y) — d(x, z) + r — K^. 

Lemma 2.8 (The eomplement of a shadow is approximately a shadow). jCMlOl Lemma 4.6] There 
is a constant K^, which only depends on 5, such that for all constants r ^ K^, and all x, z € G with 
d{x,z) ^r + 2K5, 

Sx{z, d{x, z) — r + Kq) C G \ Sz{x, r) C Sx{z, d{x, z) — r — K5). 

We may further extend the definition of a shadow to subsets of G x G. Let U C G x G, and 
define the shadow Si{U,r) to be 

Si{U,r)= U S,{x,r) X S,{y,r). 

We shall continue to write Si{U,r) for the shadow in this case. Hopefully this will not cause 
confusion, as it should be clear from context whether T is a subset of G or G x G. 

Finally, we remark that the lower bound for the Gromov product in a shadow, Proposition 12. 3[ 
immediately implies that the r-shadow of an s-shadow is contained in the shadow S'i(T, min{r, s} — 
28). 

Proposition 2.9. Let T he a subset of either G or G x G. Then 

Si (5i (T, s) , r) C Si (T, min{r, s]^25), 

for all r and s. 

2.4 Exponential decay for shadows 

In this section we show the following upper bounds for measures of shadows. 

Lemma 2.10. Let n he a finitely supported prohahility distribution on G whose support generates a 
non- elementary subgroup, and let v he the corresponding harmonic measure. Then there are constants 
Kq, Kj and c < 1, such that for any x with d{l, x) ^ Kq and for any r, 

v{Si{x,r)) < c''. 
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and 

/.i„(5i(x,r)) K-jd'. 

The constants Kq, Kj and c depend on ^ and 6, but not on r or x, as long as d{l^x) Kq. 

Here we write Kjc^ instead of 0(c''), as it will be convenient to know explicitly the dependence 
of the implicit constants in 0{c^). This result also applies to the reflected random walk generated 
by the probability distribution Jl{g) = /i(.g^^), and we may choose the constants to be the same for 
both random walks. 

The proof of this result is essentially the same as the proof of exponential decay of measures 
of halfspaccs from jMahlO] . Shadows arc slightly more general sets than halfspaces, so the shadow 
result is not an immediate consequence of the halfspacc result, although the halfspace result docs 
follow from the version for shadows. Although the shadow version could be deduced from the 
halfspacc version, this still requires extra work, so wc choose to give an argument here purely in 
terms of shadows. Wc start by giving some conditions on a family of nested subsets Xq D Xi Z) ■ ■ ■ 
of G, which guarantee that their measures decay exponentially in the index of the sets. We then 
show that a shadow Si {x, r) is contained in a nested family of shadows satisfying the conditions, 
and furthermore, the number of sets in the nested family is linear in r. 

If A and B are subsets of G, then we define d{A, B), the distance between A and B, to be the 
smallest distance between any pair of points in An G and B D G. If either of these sets is empty, 
the distance is undefined. 

Lemma 2.11. Let ^ be a probability distribution of finite support of diameter D. Let Xq D Xi D 
X2 D ■ ■ ■ be a sequence of nested closed subsets of G with the following properties: 

1 ^ Xo (1) 

x\x^n x,+i = (2) 

d{X\X„X,+i);^D (3) 

Furthermore, suppose there is a constant < e < ^ such that for any x G Xi\ Xi^i, 

VxiX^+2) s; e, (4) 
z/,(X\X,_i) «;e, (5) 

then there are constants c < 1 and K, which only depend on t, such that v{Xi) ^ d and fj,n{Xi) ^ 
Kc\ 

Proof. By properties ((J), ^ and Proposition 12. 4[ any sequence of points which converges into the 
limit set of must contain points in X^+i. As the diameter of the support of fj, is D, property ([3]) 
implies that any sample path which converges into Xi^2 must contain at least one point in Xi\Xi^i. 
Therefore, in order to find an upper bound for the probability a sample converges into Xi+2, given 
that it converges into X^+i, we can condition on the location at which the sample path first hits 
Xi \Xi+i. Let F be the distribution of first hitting times in Xi \Xi+i for those sample paths which 
converge into Xi^2, then, 

F{x)MX,+2) 

v{X^+2) _ xgXAX.+ i 

K^.+i) " ^ F{x)v,{X,+i) 

x£X^\Xi + l 
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For all X S Xi\Xi+i, there is an upper bound i'x{Xi+2) ^ e, by property (|H). As X^+i is contained 
in Xi-i, there is a lower bound v^^Xi+i) ^ (^^(Xi-i), and by property ([5]), ^ 1 — e. 

Therefore 

1-e ^ ' 

as e < ^. Therefore iy{Xi) ^ c\ where we may choose c — e/(l — e). 
The measure v is /i-stationary, and so /i„-stationary for all n, i.e. 

I'iXi) = ^ ^ri{9)Vx{Xi). 

g€G 

As all terms in the sum are positive, we may discard some of the terms and the sum will still be 
bounded above by the upper bound for ly(Xi), i.e. 

^ X! lJ.n{g)vx{Xi). 

geA'i+i\Xi+2 

The measure Vx{Xi) is at least 1 — e by ([S]), which implies 



and we may rewrite this as 



^ (l-e)/i„(X,;+i\X,+2). 

As Xi+1 = Xi+i \ Xi+2 U Xi+2 \ Xi+j, U • ■ • this implies 



1 - e 1 - c 



so UniXi) ^ Kd, where l//v = (1 — e)(l — c). The constant K only depends on e, as c only depends 
on e. □ 

We wish to apply this lemma to shadows of points. We start by showing that as the harmonic 
measure v is non-atomic, the harmonic measure of the shadows of points 5*1 (x,r) tends to zero as r 
tends to infinity, uniformly in x. 

Proposition 2.12. For any e > there is a constant K^, which depends on e and ji, such that if 
r ^ Kf, then v{Si{x,r)) ^ e. 

Proof. Suppose not, then there is an e > 0, and a sequence of shadows Si{xi, ri), with — cxd such 
that i'{Si{xi,ri)) ^ e. Let Un ~ Ui>n Si{xi,ri), and let U ~ f] C/„, so U consists of all points which 
lie in infinitely many r-shadows. The sets Un are decreasing, i.e. C/„ D Un+i, and v{Un) ^ e for all 
77., so ^{U) ^ e, and so in particular U is non-empty. 

Given A g [/, pass to a subsequence, which by abuse of notation we shall still refer to as Sx{xi,ri), 
such that A € Si{xi,ri) for all i . Let yihe any sequence of points with yi g Si{xi,ri). By Proposition 
12. 3i {yi ■ A)i ^ — 2(5 which tends to infinity as 7 — > oo, which implies that yi — >■ A. But this implies 
U ~ {A}, which must have measure zero, as the measure v is non-atomic, which contradicts the fact 
that iy{U) ^ e > 0. □ 
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It will be convenient to choose e < ^, so from now on we will fix a value of Kg, which ensures 
that Proposition 12.121 holds for some e with e < ^ ■ We now complete the proof of Lemma 12.101 by 
showing that a shadow 5*1(2;, r) has a nested family of sets X„ satisfying Lemma [2.111 where the 
number of sets is linear in r. The constant L in Lemma 12.131 depends only on /z and 5, as does the 
choice of constant e from Proposition 12.121 so the constants arising from Lemma [2.111 will depend 
only on ji and 5. 













1 








X 



Xo= X2= X^r= Si{x,r) 

Si{x,L) Si{x,2L) Si{x,LN) 



Figure 7: Nested shadows. 

Lemma 2.13. For any constant D, there is constant L, which depends on fi and 5, with the following 
•properties. For any shadow Si{x,r), with d{l,x) > 2L, let N be the largest integer such that 
N ^ r/L — 2. Then the sets Xn = Si{x,L{n + 1)), for ^ n ^ N, form a sequence of nested sets, 
which contain Si{x,r), and which satisfy properties (1-5) from Lemma \2.11\ above. 

Proof. Let L = D + 2K2 + K3 + + K% + 2(5, where D is the diameter of the support of /z, and 
the constants Ki are the constants from Lemmas 12. 6112. 71 12. 81 and Proposition 12 . 121 respectivelv. We 
may assume that L > 0. The sets X„ = Si{x, Ln) are nested, i.e. Xq D D • • • , by the definition 
of shadows, and Si{x,r) C Xn for n ^ N ^ r/L — 2. We now check properties (1-5) from Lemma 

ll]) The Gromov product (1 • a;)i = 0. For all y £ Xq the Gromov product {x ■ y)i ^ i > 0, so 

([2]) By Property 12.2141 of the Gromov product, for any sequence yi ~^ y £ dG, liminfi(x • yi)i ^ 
{x ■y)i — 26. Therefore, if y G -^^n+i = Si{x, L{n + 2)), then for any sequence yi — > y, all but finitely 
many points lie in Xn = Si{x, L{n + 1)), as L > 2S. Therefore Xn+i H X \ Xn = 0, as required. 

([3]) Two shadows which are sufficiently nested in terms of their shadow parameters, are also 
metrically nested in terms of the distance in G, by Lemma [^TBl We shall apply Lemma [^751 choosing 
the constant ^4 to be D and the constant r to be nL. Recall that L ^ D + 2K2, where D is 
the diameter of the support of fi, and K2 is the constant from Lemma 12.61 This implies that 
d{l,x) ^ D + L{n + 1) + 2K2 for all < n < iV - 1, by our choice of N. Therefore Lemma [2Jl 
implies that d{Si{x, L{n + 1)), G \ Si{x, Ln)) > D, so d{Xn+i,G \ Xn) ^ D, as required. 

(jl]) Suppose that y ^ Xn+i. We wish to show that that Xn+2 is contained in a shadow with 
bascpoint y, with a lower bound on the size of its r-parametcr. This in turn will give an upper 
bound on the harmonic measure of the shadow. We may change the bascpoint for the shadows using 
Lemma [^771 so as long as [x ■ y)i ^ r — K^, Lemma [^771 implies that the shadow Si{x, r) is contained 
in Sy{x, s), where 

s = r + d{x, y) — d{\,x) — K4. 
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As y ^ Xn+i, this implies that {x ■ y)i < L{n + 1). Therefore choosing r = L{n + 2) implies that 
{x ■ y)i < r — L, and as we have chosen L > K3, the conditions of Lemma 12.71 are satisfied. 

Therefore Uy{Sy{x, s)) is an upper bound for Uy{Xn-\.2)- The harmonic measure Vy{Sy{x^ s)) is 
equal to v{Si[y~^x, s)), and this is at most e < i as long as s ^ /Cgj by Proposition 12. 121 We now 
verify this last inequality. By the definition of the Gromov product, 

d{x, y) - d{l,x) = d{l,y) ~ {x ■ y)i. 

As d{l,y) ^ 0, and {x-y)i < L{n+1) this implies that s ^ L — K3. As we have chosen L > K^ + K^, 
this implies that s ^ Ks, as required. 

jS]) Suppose that y G Xn- We wish to show that G\Xn-i is contained in a shadow with basepoint 
y, with a lower bound on the size of its r-parameter, which gives an upper bound on the harmonic 
measure of the shadow. We have chosen L such that L{n — 1) ^ K^, and d{l, x) ^ L{n — 1) + 2K5, 
so by Lemma 12. 8[ 

G\Xn-i = G\Si{x,L{n-l))c S,il,r), 

where r = d(l, x) — L(n — 1) — K^. The argument is now essentially the same as in case (4), except 
with 1 and x interchanged. Let y £ Xn, so {x ■ y)i ^ Ln. As L ^ + K^, we may apply Lemma 
12.71 which implies that S'a;(l,r) C S'y(l,s), where 

s = d{l,y) - L{n - 1) - K5 - K^. 

We now wish to use Proposition 12.12] to find an upper bound for h'y{Sy{l, s)) which is equal to 
i'{Si{y~^ , s)). By thin triangles and the definition of the Gromov product, d{l,y) ^ {x-y)i —26, so 

d{l,y) ^ L{n - I) + L - 25, 

which we may rewrite as 

d{l,y) - L{n - 1) - K4 - K5 ^ L-K^- K5 - 26, 

where the left hand side is equal to s. As wc have chosen L ^ K4 + + Ks + 26, this shows 
that s > Ks- Therefore Proposition 12.121 implies that iy{Si{y^^ , s)) ^ e, so i'y{Xn-i) ^ e < 5, as 
required. □ 

3 Linear progress 

In this section we prove Theorem 11.21 i.e. we show that sample paths make linear progress at some 
rate L, and furthermore, the proportion of sample paths at time n which are distance at most Ln 
from 1 decays exponentially in n. As d{l,g) is equal to d(l,g~^) the reflected random walk also 
makes linear progress at the same rate L, and with the same exponential decay constant for the 
proportion of sample paths distance less than Ln from the origin 1. 

A random walk of length nk, determined by a probability distribution /i, may be thought of as 
a random walk of length n, determined by the probability distribution fik- We shall write for 
Wkni and wc shall call this the k-iterated random walk. The steps of the fc- iterated random walk 
are ~ .. .Sik, and so for each i, the segment of the random walk from wf to w^^^ is 

independently and identically distributed according to the probability distribution /ife, the fc-fold 
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convolution of fi. However, the distance from 1 to wf,^^ most + d{wf , wf_^_i) , but may 

be smaller, as the random walk may have "backtracking," i.e. the geodesic from wf to wf+i may 
fellow travel with a terminal segment of the geodesic from 1 to iff . This is illustrated schematically 
in Figure m for the first few steps of the /c-iterated random walk. 

Set to be the random variable corresponding to the change in distance from the basepoint 1 
from time i — 1 to time i of the fc-iterated random walk, i.e. 

X^^d{l,w'i)~d{l,wl,), 

which may be negative. The sum of the first n random variables Xj^ is equal to the distance travelled 
at the n-th step of the fc-iterated walk, i.e. 

n 
i=l 

We may write Xf = Y^'^ — , where Y^^ is the distance the fc- iterated random walk travels between 
steps i — 1 and i, i.e. 

and Zf = Fj'^ — Xj'. The Y^ form an independent collection of random variables, but the do 
not. By the definition of the Gromov product, 

Zf = 2(l.iffL^^, 

and we may think of as the amount of backtracking the iterated random walk does from step 
i — 1 to step i. In particular, the Z^ are non-negative. In order to find lower bound estimates for the 
sums of the Xi, it suffices to find lower bound estimates for the sums of the Yi, and upper bound 
estimates for the sums of the Zi, and we now show how to do this, using standard results from the 
theory of concentration of measures. 

The distances Y/' = d{w'[_i,w'^) form a sequence of independent, identically distributed random 
variables, so estimates on the behaviour of the sums of these random variables are well known. Let 
Y'^ be the expected value of Y^'', which depends on k, but not on i. As the trajectories of the 
random walk converge to the boundary almost surely, Y^ — > co as fc — oo. We will use the following 
Bernstein or Chernoff-Hocffding estimate, see for example Dubhashi and Panconesi jDP09[ Theorem 
1.1] which says that the probability that the sum of n copies of Y^^ deviates from the expected mean 
nY'' by at least en decays exponentially in n. 

Theorem 3.1. Let Yi be a sequence of bounded independent identically distributed random variables 
with mean Y . Then for any e > there is a constant c < 1 such that 



We now show a similar bound for the sums of the Z^. We start by showing that the distribution 
functions of the Z^ are bounded above by the same exponential function, for all k and i. Furthermore, 
the upper bound for holds independently of the values of Z!j for j < i. As is a function of 
Wj for j ^ i, it suffices to show that the upper bound is independent of the values of Wj for j < i. 
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Proposition 3.2. There are constants K and c < \, which do not depend on k or i, such that 

Proof. By the definition of , if Z^ ^ r then (1 • w^)^k ^ ^ ir. By the definition of shadows, this 
condition is equivalent to the condition 6 S^k ^(1, ^r), therefore 

P(Z,f ^t\wI..., wti) = P (w^ e S^k_^ (1, \r) \w\,..., wU) ■ 
We may apply the isonietry {w^_^)~^ , and use the fact that {w^_^)~^w^ = s^, to obtain, 

P(Zf ^ r I wl . . . = P {4 e S,{{wU)-\ ir) | . . . , wU) . 

As s\ is distributed as /i^, and is independent of the for j < i, this implies. 

Now using Lemma 12.101 there are constants and c < 1 such that the bound fik{Si{g, ^r)) 5C 
K^d^^'^, is independent of g and fc, so this implies 

V{Z^^r\w1,...,wU)^Ksc'^^ 

as required. □ 

In particular, this gives an upper bound for the expected value of Z^ which is independent of k. 
Therefore, by choosing k to be large, we can make the expected value of Y/' much larger than the 
expected value of Z^. 

We now show that there is a constant L > 0, which is independent of fc, such that the probability 
that the sum Zi + ■ ■ ■ + Z^^ is larger than Ln decays exponentially in n. 

Lemma 3.3. Let he the k-iterated random walk of length n, generated by a finitely supported 
probability distribution fj, which generated a non- elementary subgroup of the mapping class group G, 
and let Z^ ~ 2(1 • w'^)^k ^. Then there are constants L,K and c < 1, which depend on fi but are 
independent of k, such that 

P(Zf + • ■ • + ^ Ln) s; /vc", 

for all n. 

Proof. We have shown that the probability that Z^ ^ r decays exponentially in r, with exponential 
decay constants which do not depend on either fc or i, or the values of any other Zj for j < i. 
As Z!^ is also independent of Zj for j > i, this implies that the exponential bounds for hold 
independently of the vales of Zj for all j ^ i. Therefore, the probability distribution of the sum 
+ ■ • ■ + Z^' will be bounded above by a multiple i^" of the n-fold convolution of the exponential 
distribution function with itself. We will use the following Chernoff-Hoeffding bound for sums of 
exponential random variables. The version stated below is an exercise from Dubhashi and Panconesi 
[DP09) ■ but we provide a proof in Appendix \K\ for completeness. 



17 



Proposition 3.4. [DP09[ Problem 1.10]. Let Ai be independent identically distributed exponential 
random variables, with expected value A. Then for any t ^ 0, 

P(Ai + • • • + A„ > (1 + t)nA) ^ 

The upper bound for the sum of the will therefore be times the upper bound above, i.e. 

P{Z^ + --- + Z^^^{1+ t)nZ^) {^^K 

The expected value Z'^ is bounded above for all k, so by choosing t sufficiently large, we may ensure 
that the base of the exponent on the right is strictly less that 1. This completes the proof of Lemma 

1331 □ 

We now complete the proof of Theorem 11.21 Recall that d{l,w^) = + ■ • ■ + X!^, and X^ = 
Y,^ - Z\. So if 

n 

Y,X\ ^n{Y^ -e-(\+t)Z^), 

then ^ VY^ — nY^ < -en, or H V Z^ ^ (1 + t)nZ^ , though of course both conditions 

may be satisfied. Furthermore, we may choose fc sufficiently large such that L = Y'^ — e—{l+t)Z'' is 
positive. The probability that at least one of the events occurs is at most the sum of the probability 
that either occurs, so 

F{d{l,w^) ^ Ln) ^ c'^ + Kc'^, 

for some constants ci < 1 from Theorem 13.11 and C2 < 1 from Lcmma l3.31 and this decays exponen- 
tially in n, as required. This completes the proof of Theorem 11.21 





4 Translation length 

In this section we prove Theorem ll.il We start by showing that translation length of g is coarsely 
equivalent to the length of the (relative) shortest element in the conjugacy class of g, which we shall 
denote [g], i.e. 

[g] = inf dn,hgh-^). 

heG 

Lemma 4.1. Let G be the mapping class group of a non-sporadic surface. There is a constant K 
such that \T{g) — [g] \ ^ K. 

Proof. Let g be a conjugate of minimal relative length [g]. By the definition of translation length, 
T{g) ^ d{l,g) ~ [g]. We now show the bound in the other direction. 

There is a constant M, which depends only on the surface, such that every non-pseudo-Anosov 
element is conjugate to an element of relative length at most M, see for example |Mahlll Lemma 
5.5] so we shall choose K > M and then we may assume that g is pseudo-Anosov. Let a be a 
quasi-axis for g, i.e. a bi-infinite quasigeodesic such that a and g"a are 25-fellow travellers for all 
n. Let h he a closest point on a to 1, then gh is distance at most T{g) + K from a. This implies 
that the distance from h to gh is at most T{g) + K, so d{l, h~^gh) is at most T{g) + K. Therefore 
[g] ^ T{g) + K, as required. □ 
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The mapping class group has relative conjugacy bounds, [Mahlll Theorem 3.1], i.e. there is a 
constant K, which only depends on the sm'face, such that if a and b are conjugate, then a = vbv~^ 
for some element v with 

d{l,v) s$ K{d{l,a) + dil,b)). 

If is a group element, then we may think of i? as a point in the metric space (G, d). However, we 
can also represent g by a choice of geodesic in G from 1 to g. Geodesies need not be unique, but 
any two distinct choices of geodesies are Hausdorff distance at most 26 apart. This gives two ways 
of representing a product gh of two group elements g and h. We may choose a single geodesic from 
1 to gh, or alternatively choose a path from 1 to gh consisting of two geodesic segments, the first 
consisting of a geodesic from 1 to g, and the second consisting of a geodesic from g to gh, which is 
the translate of a geodesic from 1 to h. Therefore if g is equal to vsv~^, we can represent <? by a 
path composed of three geodesic segments, each consisting of a translate of v, s and respectively, 
and this is what we mean when we refer to the path vsv^^. The fact that G has relative conjugacy 
bounds implies that if an element g is conjugate to a short element s, and is a conjugating word 
of shortest possible relative length, then the path vsv~-^ is quasigeodesic, where the quasigeodesic 
constants depend on d{l,s), the constant of hyperbolicity S, and the relative conjugacy bounds 
constant K. 

Lemma 4.2. [Mahlll Lemma 4.2] Let G be a weakly relatively hyperbolic group with relative conju- 
gacy bounds. Let g be an element of G which is conjugate to an element s, i.e. g = vsv~^ , for some 
V G G. If we choose v to be a conjugating word of shortest relative length, then the word vsv~^ is 
quasi-geodesic in the relative metric, with quasi-geodesic constants which depend only on the relative 
length of s, and the group constants 6 and K. 



Si{v, v) - K) Sgigv, d(l, v) - K) 













a 



Figure 8: A quasigeodesic path 

Proposition 4.3. For any constant T there is a constant Kg, which only depends on T, the constant 
of hyperbolicity S, and the relative conjugacy bounds constant, such that if g is conjugate to an element 
s of relative length at most T , then g = vsv^^ for some v with the following properties: 

L dil,v) ^ ^d{l,g)^Kg 

2. geSi{v,d{l,v)-Kg) 

3. leSgigv,d{l,v)~Kg) 
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Proof. Let g = vsv~^, where d{l, s) and w is a conjugating element of shortest (relative) length. 
The first inequality follows from the triangle inequality, which implies that d{l, v) ^ ^d{l,g) — T/2. 
As the path vsv~^ is a quasigcodesic, there is a constant L, which only depends on T, the constant 
of hyperbolicity i5, and the conjugacy bounds constant, such that the distance from v to a geodesic 
from 1 to g is at most L. This implies that if p is the nearest point projection of t; to a geodesic from 
1 to g, then ^ c?(l, v) — L. As any geodesic from ?; to g is contained in a /Ci-neighbourhood 

of the nearest point projection path, consisting of a geodesic from v to p, and then from p to g, 
where Ki only depends on S. This implies that the distance from 1 to any geodesic from d to g is 
at least d{l,v) — L — Ki. Finally, as the Gromov product (v ■ g)i is equal to the distance from 1 
to a geodesic from v to up to bounded additive error 26, this implies that {v ■ g)i ^ c?(l, v) — K, 
where K = L + Ki + 26, which only depends on the constant of hyperbolicity 6. This means 
that g G Si{v,d{l,v) — K), and the same argument applied to the points \,g and vs implies that 
1 G Sg{gv,d{l,v) — K), as vs = gv, for the same constant K. We may therefore choose Kg to be 
the maximum of K and T/2. □ 

Proposition 14.31 above shows that the probability that a random walk Wn is conjugate to an 
element of relative length at most T, is bounded above by the probability that there is an clement 
V, with d{l, v) ^ ^d{l, Wn) — K9, such that Wn £ Si{v, d{l, v) ~ Kg), and w^^ G Si{v, d{l, v) — Kg). 
We shall write A"„ for the measure corresponding to the distribution of pairs (w„, ^,7^) on G x G, 
i.e. 

XniU)^¥{{wn,w-^) G U), 

for any subset U C G x G. As F{d{l,Wn)) ^ Ln decays exponentially, by Theorem 11.21 this gives 
the following upper bound for that the probability that Wn is conjugate to an element of length at 
most T, 

P{T{wn) T) s; X„(5i(A, \Ln - Kg) + 0(c"), 

for some constant c < 1, and where A is the diagonal in G x G. 

Therefore, in order to complete the proof of Theorem 1 it suffices to show: 

Lemma 4.4. Let L he a constant such that '¥{d{\,Wn) ^ Ln) decays exponentially in n. Then for 
any K , there is a constant c < 1, which depends on K and fi, such that 

X„(^i(A, iin-i^)) ^0(c"). 

The rest of this section is devoted to the proof of Lemma 14.41 In fact, it will be convenient 
to obtain upper bounds for A"2„ rather than X„. This suffices to obtain upper bounds for Xn for 
all n, as if D is the diameter of the support of n, then X2n-i{Si{U,r)) < X2n{Si{U,r — D)), by 
Proposition [2?5] 

We start by showing that it is very likely that a random walk W2n lies in the shadow Si{wn, ■^d(l, Wn))- 

Proposition 4.5. The probability that w^n Hs-s in Si{wn,\d{l,Wn)) tends to one exponentially 
quickly as n tends to infinity, i.e. 

for some c < 1. 
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Proof. We shall find an upper bound for the probability that W2n does not lie in the shadow 
Si{wn, 5^(1, Wn))- Conditioning on w„ ~ g, and using the fact that the complement of the shadow 
Si{wn, ^d{l,Wn)) is contained in 5"^,^ (1, w„) — K^), Lemma \TM gives 

F{w2n Si{wn, ^d{l,Wn)) ^ ^ /i„ (5)P(5S„+i . . . S2n G 5^(1, ^d{l,g) - K5) \ w„ = g). 

geG 

The condition gs„-|_i ... S2n G <5'g(l, .?)— i^s) is the same as s„+i...S2n £ Si{g^^ , ^d{l, g) — K^), 
and as the Sn+i, ■ ■ ■ , S2n are independent of Wn, this implies that 

PK„ ^^i(w„,id(l,u;„)) ^ ^Mn(5)M«(^i(3"\^d(l,g)-X5)). 

see 

By Theorem 11.21 the probability that (i(l, w„) ^ Ln is at most 0(c"), for some ci < 1, which gives 

nW2n(^Si{Wn,\d{l,Wn))^0{dl)+ ^ (^K (5i (r?" \ id(l , 5) - i^s)) • 

geG\S(l,Ln) 

The upper bound for the measure of a shadow, Lemma l2.10[ then gives the following upper bound, 

geG\B(l,Ln) 

for some constant C2 < 1. Therefore 

P(^«2„ ^ id(l,u;„)) sc: 0(cr) + 0(C2"/'), 

which decays exponentially in n, as required. □ 

Applying this result to the reflected random walk implies that the probability that iWjri does not 
lie in S'i(u'^„^z/;„, ^c?(l, w^^^Wn)) also decays exponentially. 

We now use this to find an upper bound for X2 ji ill terms of ji^i x jin- 

Proposition 4.6. Let T he a subset of G x G. There are constants L > and c < 1 such that 

X2n{Si{T,r)) /i„ X MSi{T,mm{r, \Ln} - 25)) + 0(c"). 

Proof. We have shown that the probability that each of the following four events occurs tends to 
one exponentially quickly. 

c?(l, Wn) ^ Ln 
d{l,W2nWn) ^ Ln 

W2n e S'l(w„, ^d{l,Wn)) 

W2n e Siiw^^Wn, id(l,W^,>„)) 

Therefore the probability that all four of them occur tends to one exponentially quickly. 

If all four events occur, then {wn ■ W2n)i ^ ^d{l,'Wn) ^ ^Ln, and similarly, (w^^^Wn ■ ^ 
^d{l,W2n'Wn) ^ ^Ln. Furthermore, if (i/;2„,w^„^) lies in Sr{T), then there is a point (s,i) G T 
such that {w2n -5)1 ^ r and (w2n^ • t)i ^ r. This implies that • s)i min{r, ^Ln} — 26, and 
{w2nWn ' t)i ^ min{r, ^Ln} — 2(5, and so (wn, ui^^^Wn) G S'i(T, min{r, ^in} — 2(5), as required. □ 
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Finally, we now show that the /i„ x ^„-nieasure of a shadow of the diagonal S'i(A,r) decays 
exponentially in r. 

Proposition 4.7. There are constants ci < 1 and C2 < 1 such that 



for all n and r. 

Proof. Let w„ and w„ be random walks determined by /i and Jl respectively. If {vn,Wn) € 5'i(A,r), 
then there is a point x such that (w„ ■ x)i ^ r and (w„ - a;)! ^ r. Therefore (w„ • Wn)i ^ r — 26, 
and so u„ S S'i(w„,r — 25). By the upper bound for measures of shadows, Lemma |2.10[ for any Wn 
with d{l,Wn) ^ Kg, the probability that w„ G 5*1 r — 26) is at most iiTyc^"^'^, for some ci < 1. 
Furthermore, by Theorem ll.2[ there is a C2 < 1 such that the probability that d{l,Wn) ^ Kq is at 
most KtC2, for n ^ Kq/L. Therefore /i„ x /i„(S'i(A, r)) ^ 0(c5^) + ©(c^), as required. □ 

Combining Propositions 14.61 and 14.71 establishes Lemma 14. 4[ and so completes the proof of The- 
orem (TTTJ 

A ChernofF-Hoeffding bounds for exponential random vari- 
ables 

In this section we provide the details for the following ChernofF-Hoeffding bound for exponential 
random variables. This proof is the solution given by Dubhashi and Panconesi to |DP091 Problem 
1.10], which appeared in the initial draft version, but not in the final published version. 

Proposition A.l. Let Zi be independent identically distributed exponential random variables, with 
expected value Z . Then for any t ^ 0, 



Proof. Let have probability density function f{x) = ae so the expected value of Zi is Z = 
1/a, and set Sn = Zi + ■ ■ ■ + Z^. Consider the moment generating function 



/i„ x/I„(5i(A,r)) ^OK-) + 0(cS), 





for < A < a. Therefore 




It now follows from Markov's inequality that 



^S)^ ^^^^ = - 




The right hand side above is minimized by choosing A = a — n/s, which gives 
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Setting s = (1 + t)nZ, and using the fact that Z = 1/a, yields, 

V[Sn > {l + t)nZ) ' "^^^ 
as required. □ 

References 

[BHM08] S. Blachcrc, P. Hai'ssinsky, and P. Mathicu, Harmonic measures versus quasiconformal measures for hyper- 
bolic groups (2008), available at arXiv : 0806^.^915 

[BH99] Martin R. Bridson and Andre Haefliger, Metric spaces of non-positive curvature, Grundlehren der Math- 
ematischen Wissenschaften [Fundamental Principles of Mathematical Sciences], vol. 319, Springer- Verlag, 
Berlin, 1999. 

[CMIO] Danny Calegari and Joseph Maher, Statistics and compression of scl (2010), available at 
, arXiv : math / 1008 . 4952 

[DP09] Devdatt P. Dubhashi and Alessandro Panconesi, Concentration of measure for the analysis of randomized 
algorithms, Cambridge University Press, Cambridge, 2009. 

[FM] Benson Farb and Dan Margalit, A primer on mapping class groups, Princeton University Press. 

[KM96] Vadim A. Kaimanovich and Howard Masur, The Poisson boundary of the mapping class group, Invent. 
Math. 125 (1996), no. 2, 221-264. 

[Kla] E. Klarreich, The boundary at infinity of the curve complex and the relative TeichmiiUer space. 

[KowOS] E. Kowalski, The large sieve and its applications, Cambridge Tracts in Mathematics, vol. 175, Cambridge 
University Press, Cambridge, 2008. Arithmetic geometry, random walks and discrete groups. 

[LMll] Alex Lubotzky and C. Meiri, Sieve methods in group theory II. The Torelli group (2011), available at 
TarXlv: 1104.24501 

[MahlO] Joseph Maher, Linear progress in the complex of curves. Trans. Amer. Math. Soc. 362 (2010), no. 6, 
2963-2991. 

[Mahll] Joseph Maher, Random walks on the mapping class group, Duke Math. J. 156 (2011), no. 3, 429-468. 

[MSll] Justin Malestcin and Juan Souto, On genericity of pseudo-Anosovs in the Torelli group (2011), available 
at arXiv: 1102.0601 

[MM99] Howard A. Masur and Yair N. Minsky, Geometry of the complex of curves. I. Hyperbolicity, Invent. Math. 
138 (1999), no. 1, 103-149. 

[MM86] Darryl McCuUough and Andy Miller, The genus 2 Torelli group is not finitely generated. Topology Appl. 
22 (1986), no. 1, 43-49. 

[RivOS] Igor Rivin, Walks on groups, counting reducible matrices, polynomials, and surface and free group auto- 
morphisms, Duke Math. J. 142 (2008), no. 2, 353-379. 

[R,iv09] Igor R,ivin, Walks on graphs and lattices — effective bounds and applications. Forum Math. 21 (2009), no. 4, 
673-685. 

[WoeOO] Wolfgang Wocss, Random walks on infinite graphs and groups, Cambridge Tracts in Mathematics, vol. 138, 
Cambridge University Press, Cambridge, 2000. 



23 



